J. Biochem. 148, 525-536 (2008) 
doi:10.1093/jb/mvm246 


Residues on the Dimer Interface of SARS Coronavirus 3C-like 
Protease: Dimer Stability Characterization and Enzyme 


Catalytic Activity Analysis 


Shuai Chen, Jian Zhang, Tiancen Hu, Kaixian Chen, Hualiang Jiang’ and Xu Shen* 


Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia 
Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China 


Received August 26, 2007; accepted December 18, 2007; published online January 7, 2008 


3C-like protease (8CL?*°®) plays pivotal roles in the life cycle of severe acute 
respiratory syndrome coronavirus (SARS-CoV) and only the dimeric protease is 
proposed as the functional form. Guided by the crystal structure and molecular 
dynamics simulations, we performed systematic mutation analyses to identify 
residues critical for 3CLP”° dimerization and activity in this study. Seven residues 
on the dimer interface were selected for evaluating their contributions to dimer 
stability and catalytic activity by biophysical and biochemical methods. These 
residues are involved in dimerization through hydrogen bonding and broadly 
located in the N-terminal finger, the a-helix A’ of domain I, and the oxyanion loop 
near the S1 substrate-binding subsite in domain II. We revealed that all seven single 
mutated proteases still have the dimeric species but the monomer-dimer equilibria 
of these mutants vary from each other, implying that these residues might contribute 
differently to the dimer stability. Such a conclusion could be further verified by the 
results that the proteolytic activities of these mutants also decrease to varying 
degrees. The present study would help us better understand the dimerization- 
activity relationship of SARS-CoV 3CL?*® and afford potential information for 
designing anti-viral compounds targeting the dimer interface of the protease. 


Key words: catalytic mechanism, dimerization-activity relationship, dimer interface, 
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Abbreviations: 3Cl°"°, 3C-like protease; CD, circular dichroism; Dabcyl, 4-[[4-(dimethylamino) phenyl] azo] 
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energy transfer; SARS-CoV, 
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The disease of severe acute respiratory syndrome (SARS) 
broke out in China and menaced to more than 30 other 
countries from the end of 2002 to June 2003. SARS 
coronavirus (SARS-CoV) was identified as the etiological 
agent responsible for this infection (1, 2). SARS-CoV 
involves the largest viral RNA genome known to date, 
encompassing 29,727 nucleotides predicted to contain 14 
functional open reading frames (ORFs) (3). Two large 
5/-terminal ORFs, la and 1b, encode two overlapping 
polyproteins, ppla (around 450kDa) and pplab (around 
750 kDa) necessary for viral RNA synthesis. Polyproteins 
ppla and pplab are cleaved extensively by 3C-like 
protease (8CL?™) and a papain-like cysteine protease 
(PL2?*°) to yield a multi-subunits protein complex 
called ‘viral replicase-transcriptase’ (4). Considering its 
functional indispensability in coronavirus life cycle, 
SARS-CoV 3CL°*° has become an attractive target in 
discovering new anti-SARS agents (5). 
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The crystal structure revealed that SARS-CoV 3CL?”° 
can form a dimer with the two monomers oriented 
perpendicular to one another (Fig. 1A and B) (6). Each 
monomer contains three domains: domains I (residues 
8-101) and II (residues 102-184) have six-stranded 
antiparallel B-barrel forming a chymotrypsin fold, the 
substrate-binding pocket is located in a cleft between 
these two domains, while domain III (residues 201-308) 
is an antiparallel globular cluster of five «a-helices 
connecting to domain II by a long loop region (residues 
185-200). The 16-residue loop region has been implicated 
to mediate the substrate-binding (7). Based on the crystal 
structure, the N-terminal finger (residues 1-7) might 
play an important role in both dimerization and enzy- 
matic activity of SARS-CoV 3CL?*°. Numerous reports 
have proven that the N-terminal finger contributes well 
to dimerization of SARS-CoV 3CL?*° (8-10). In addition, 
domain III has also been revealed to extensively involve 
in monomer—monomer interactions (7, 11). Furthermore, 
Hsu et al. (9) reported that the residue Arg4 and the last 
C-terminal helix (residues 293-306) are critical for 
stabilizing the dimer structure to maintain a correct 
conformation of the active site. As the crystal structures 
of different CoV 3CL proteases give similar dimeric 
structures and nearly all residues of 3CL?*° involved in 
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Fig. 1. The dimeric structure of SARS-CoV 3CL?’° and 
extensive residue-residue interactions on the dimer 
interface. (A) A ribbon diagram for the crystal structure of 
SARS-CoV 8CL?*° (PDB: 1UK2). Monomer A and B are 
represented as black and grey, respectively and the three 
domains are also labelled. The residues involved in monomer— 
monomer interactions, which were selected for subsequent single 


dimerization are conserved, it has been indicated that 
only the dimer is the biological functional form of SARS- 
CoV 3CL?”° (12, 13). Tan et al.(14) revealed that the low 
enzymatic activity of the dissociated monomer is mainly 
due to the collapse of the oxyanion hole in the Sl 
substrate-binding subsite. Since the dissociated monomer 
might be inactive, the dimer interface has been sug- 
gested as a potential target for rational inhibitors design 
against SARS-CoV 3CL?”° (7, 15). An octapeptide inter- 
face inhibitor, designed according to the sequence of the 
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point mutation analyses, are shown in the bond model. The 
binding peptide substrate (MP) is also shown as the stick model. 
(B) A surface model of the protease. The two monomers are in the 
same orientation as shown in panel A. (C) The dimer interface 
between monomer A and B. The bonds and residues belonging to 
monomer A or B are labelled respectively. 


N-terminal finger, was found to bind to SARS-CoV 
3CL?"° specifically and competitively (16). 

The crystal structure (6) and molecular dynamics 
calculations (14) revealed that the dimer interface of 
SARS-CoV 3CL?° mainly consists of the interactions 
between two helical domains III of each monomer, and 
the hydrogen bonding and electrostatic interactions 
between the N-terminal finger of one monomer and 
the residues near S1 _ substrate-binding subsite of 
the other monomer, in particular an oxyanion loop 
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Fig. 2. Residue-residue interactions. (A) Residue—residue 
interactions between the N-terminal finger and the S1 
subsite in the substrate-binding pocket of SARS-CoV 3CL?”. 
(B) Residue—residue interactions between two o—helix A’ (resi- 
dues 10-15) of domain I in SARS-CoV 8CL?"® dimer. The 
residues belonging to monomer A or B (PDB: 1UK2) are 


(residues 138-145) (Fig. 1A and C). According to the 
reported studies mentioned above, the structural integ- 
rity of the active site appears to be intrinsically 
connected with the presence of an intact dimer interface 
for SARS-CoV 8CL?"®. To address this hypothesis, we 
performed the structure-guided mutagenesis analyses of 
the protease in this study. Totally seven residues on the 
dimer interface were selected, including three residues in 
the N-terminus (Serl, Phe3, Arg4) (Fig. 2A), two residues 
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marked respectively. The labelled residues are shown as sticks, 
and the rest of the proteins as cartoon. Dashes represent the 
hydrogen bonds formed on the dimer interface. The hydrophobic 
interactions between the side-chain phenyl of Phe3 or 
Phe140 and the neighbouring residues are also labelled as the 
surface model. 


in the o-helix A’ of domain I (Ser10, Glul14) (Fig. 2B), 
and two residues of the oxyanion loop near the S1 
subsite in domain II (Ser139, Phe140) (Fig. 2A). These 
residues are mainly involved in dimerization of SARS- 
CoV 3CL?"® through hydrogen bonding and _ highly 
conserved among different CoV 3CL proteases. In the 
following, we evaluated the effects of these residues 
on dimer conformational stability and catalytic activity 
of SARS-CoV 3CL’*° using various biochemical and 
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Table 1. Nucleotide sequences of the primers used for site-directed mutagenesis of SARS-CoV 3CL?*°.* 


Oligonucleotide sequence (5’— 3’) Polarity Mutation introduced 
CATCACGGATCCACCATGGCTGGTTTTAGGAAAATGGC Forward SARS-CoV 3CL?”° SertAla 
GCCATTTTCCTAAAACCAGCCATGGTGGATCCGTGATG Reverse SARS-CoV 3CL?'? SerAla 
CCACCATGAGTGGTGCTAGGAAAATGGCATTCCCG Forward SARS-CoV 3CL?¥° Phe*Ala 
CGGGAATGCCATTTTCCTAGCACCACTCATGGTGG Reverse SARS-CoV 3CL?"° Phe?Ala 
CCACCATGAGTGGTTTTGCGAAAATGGCATTCCCGTC Forward SARS-CoV 3CL?"? Arg*Ala 
GACGGGAATGCCATTTTCGCAAAACCACTCATGGTGG Reverse SARS-CoV 3CL?"? Arg*Ala 
GGAAAATGGCATTCCCGGCAGGCAAAGTTGAAGG Forward SARS-CoV 3CL?"° Ser?°Ala 
CCTTCAACTTTGCCTGCCGGGAATGCCATTTTCC Reverse SARS-CoV 3CL?"? Ser?°Ala 
CGTCAGGCAAAGTTGCAGGGTGCATGGTAC Forward SARS-CoV 3CLP®° Glu*‘Ala 
GTACCATGCACCCTGCAACTTTGCCTGACG Reverse SARS-CoV 3CL?"° Glu'Ala 
CATACCATTAAAGGTGCTTTCCTTAATGGATCATGTGG Forward SARS-CoV 3CL?"° Ser??°Ala 
CCACATGATCCATTAAGGAAAGCACCTTTAATGGTATG Reverse SARS-CoV 3CLP"° Ser??°Ala 
CATACCATTAAAGGTTCTGCCCTTAATGGATCATGTGG Forward SARS-CoV 3CLP"? Phe!*°Ala 
CCACATGATCCATTAAGGGCAGAACCTTTAATGGTATG Reverse SARS-CoV 3CL?"° Phe!*°Ala 


“The mutant codons in the oligonucleotide sequences are highlighted in boldface. SARS-CoV 3CL”"® amino acids are numbered continuously 


from the N-terminal residue, Ser!, to the C-terminal residue, Gln°*™. 


biophysical techniques. It was demonstrated that all seven 
single point mutated proteases can still form the dimer at 
varying concentrations, while the monomer—dimer equili- 
bria of these mutants in solution are different from that of 
the wild type protease. Furthermore the proteolytic 
activities of these mutants decreased to varying extents 
compared with the wild type protease. Although the dimer 
formation of SARS-CoV 3CL?”® could not be disrupted 
completely by single point mutation, individual replace- 
ment of these residues by alanine might partly disrupt the 
integrality of the hydrogen bonding networks on the dimer 
interface, which perhaps induces an altered conformation 
of the substrate-binding pocket, therefore, results in the 
decrease or loss of the enzymatic activity. 


MATERIALS AND METHODS 


Simulation System—lInitial coordinates for SARS-CoV 
3CL?*° dimer was taken from the crystal structure (6) 
(PDB code: 1UK2). The missing residues were repaired 
using the loop search method in the Homology module of 
Insight II. For the simulation of SARS-CoV 3CL?”° dimer 
in aqueous solution, the protein was first put into a 
suitably sized box, of which the minimal distance from 
the protein to the box wall was 1.5nm. Then the box was 
solvated with the SPC water model (17). The protein/ 
water system was submitted to energy minimization. 
Later, counterions were added to the system to provide a 
neutral simulation system. The whole system was 
subsequently minimized again. 

Molecular Dynamics Simulations—Conventional molec- 
ular dynamics (CMD) simulations were carried out using 
the AMBER 7.0 package with NPT and _ periodic 
boundary conditions. The Amber Parm99 force field (18) 
was applied for the proteins. The Particle Mesh Ewald 
(PME) method (19) was employed to calculate the long- 
range electrostatics interactions. The non-bonded cutoff 
was set to 12.0A, and the non-bonded pairs were 
updated every 25 steps. The SHAKE method (20) was 
applied to constrain all covalent bonds involving hydro- 
gen atoms. Each simulation was coupled to a 300K 
thermal bath at 1.0atm pressure by applying the 


algorithm of Berendsen (21). The temperature and 
pressure coupling parameters were set as 0.2ps and 
0.05 ps, respectively. An integration step of 2fs was set 
up for the MD simulations. 

Materials—The restriction and modifying enzymes in 
this work were purchased from NEB. The vector pQE30 
and the bacterial strain M15 were from Qiagen. 
Isopropyl -D-thiogalactoside (IPTG) was purchased 
from Promega. The Ni-chelating column and low molec- 
ular weight marker for SDS-PAGE were purchased from 
Amersham Pharmacia Biotech. All other chemicals were 
of reagent grade or ultra-pure quality, and purchased 
from Sigma. 

Cloning, Expression and Purification of the Wild Type 
SARS-CoV 3CL?”°—The wild type SARS-CoV 3CL?”° was 
prepared according to our published method (22). The 
protease was highly pure according to SDS-PAGE and 
dialyzed to 20mM Tris-HCl pH7.5 containing 100mM 
NaCl, 5mM dithiothreitol (DTT) and 1mM ethylene 
diaminetetraacetic acid (EDTA). The purified protein was 
further confirmed by N-terminal sequencing and mass 
spectrometry, and concentrated by Centriprep (Milipore). 
The protein concentration used in all experiments was 
determined from the absorbance at 280 nm (Aggo) using a 
molar extinction coefficient (¢2g9) for the monomer of 
34,390/M cm (22, 23). 

Site-directed Mutagenesis of the Residues on the Dimer 
Interface of SARS-CoV 3CL?’°—Site-directed mutagen- 
esis of the residues on the dimer interface of SARS-CoV 
3CL?*° was processed by a modified recombinant PCR 
method. Totally seven mutated SARS-CoV 3CL?"°s 
(Ser'Ala, Phe? Ala, Arg*Ala, Ser’Ala. Glu™Ala, 
Ser’*°Ala and Phe’°Ala) were prepared with the 
QuikChange site-directed mutagenesis kit (Stratagene) 
using pQE30-SARS-CoV 3CL?"® as a template. The 
nucleotide sequences of the primers used for mutation 
were given in Table 1. The pQE30-SARS-CoV 3CL?® 
plasmids encoding mutated forms of SARS-CoV 3CL?*° 
were verified by sequencing, and then Escherichia coli 
M15 cells were transformed by the resulting plasmids. 
The mutated proteins were expressed and purified in 
a similar procedure to that for the wild type protease. 
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The purity and structural integrity of the mutated 
proteases were analysed by SDS-PAGE, N-terminal 
sequencing and mass spectrometry. The concentrated 
proteins were stored in 20mM Tris-HCl pH7.5, 100mM 
NaCl, 5mM DTT, 1mM EDTA, at —20°C. 

Circular Dichroism (CD)  Spectroscopy—Circular 
Dichroism (CD) spectra were recorded on a JASCO-810 
spectropolarimeter. The protein sample was prepared in 
20mM sodium phosphate pH 7.5, 100mM NaCl at 25°C 
with concentration of 104M. Far-UV CD spectra from 
190 to 250 nm were collected with 1nm band width using 
0.1cm path length cuvette, and normalized by subtract- 
ing the baseline recorded for the buffer. Each measure- 
ment was repeated thrice and the final result was the 
average of three independent scans. The Far-UV CD 
spectra of the mutated proteases were compared with 
that of the wild type SARS-CoV 38CL°° to exclude the 
possibility of structural misfolding caused by single point 
mutation. 

Fluorescence Spectroscopy—The fluorescence experi- 
ments were performed on a HITACHI F-2500 fluores- 
cence spectrophotometer. The protease sample was 
prepared in 20mM Tris-HCl pH7.5, 100mM NaCl with 
concentration of 54M. The fluorescence emission spectra 
from 300 to 380nm were collected after excitation at 
280nm, and the spectral slit width was 5nm for 
excitation and emission. Fluorescence spectra of the 
wild type and mutated SARS-CoV 3CL?™’s were mea- 
sured in a 1ml quartz cuvette with 1cm path length at 
25°C. All final spectra were corrected for the buffer 
contribution, and were the average of three parallel 
measurements. 

Glutaraldehyde Cross-linking SDS-PAGE—For the 
wild type and mutated SARS-CoV 3CL?"’s (final concen- 
tration from 0.2 to 5mg/ml in 20mM Tris-HCl pH7.5, 
100mM NaCl, 5mM DTT, 1mM EDTA) an aliquot of 
25% (v/v) glutaraldehyde was added to make a final 
concentration of 0.05 or 0.1% glutaraldehyde. The 
samples were incubated at 25°C for 15min followed by 
quenching the reaction with the addition of 1.0M Tris— 
HCl pH8.0 (0.5% v/v). Orthophosphoric acid was there- 
after added into the reaction mixture to result 
in precipitation of the cross-linked proteins. After 
centrifugation (12,000r.p.m., 4°C), the precipitate was 
re-dissolved in loading buffer and heated at 100°C for 
5min. SDS-PAGE was run with 10% gels. 

Size-exclusion Chromatography (SEC) Analysis—The 
dimer—monomer equilibria of the wild type and mutated 
SARS-CoV 3CL°"°s were analysed by size-exclusion 
chromatography (SEC) on a HiLoad 16/60 Superdex 75 
prep grade column through an AKTA FPLC system 
(Amersham Biosciences). Buffer used was 20mM Tris— 
HCl pH7.5, 100mM NaCl, 5mM DTT and 1mM EDTA. 
The buffer was degassed and the column was equili- 
brated with the buffer before injecting protein samples. 
Protein samples with a concentration of 5mg/ml were 
loaded on the column and then eluted with the buffer at 
a flow rate of 1ml/min by detection of absorbance at 
280nm. The integrated area values of absorbance peaks 
were calibrated by AKTA FPLC evaluation software. The 
column was calibrated using a low molecular mass gel 
filtration kit (Amersham Biosciences) with four marker 
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Table 2. Potential residue-residue interactions on the 
dimer interface of SARS-CoV 3CL?*° predicted by 5-ns 
molecular dynamics simulations.* 


Hydrogen Bond Time occupancy (%) 


A B 
i Ser10 (OG) Ser10 (OG) 99.1 
2 Gly11(N) Glul14 (OE1/OE2) 100 
3 Glul4 (OE1/OE2) Gly11(N) 100 
4 Arg4(NH1/NH2) Glu290(OE1/OE2) 95.7 
5 Glu290(0E1/OE2) Arg4(NH1/NH2) 96.9 
6 Ser139(0/OG) Gly2(N) 99.2 
7 Gly2(N) Ser139(0/OG) 30.3 
8 Phe3(N) Ser139(0/OG) 88.2 
9 Ser139(0/OG) Phe3(N) 43.5 
10 Ser1(OG) Glu166(OE1/OE2) 98.3 
11 Glu166(OE1/OE2) Ser1(OG) 55.4 
12 Phel140(N) Ser1(0/OG) 93.0 
13 Ser1(0/OG) Phe140(N) 5.1 


“A, monomer A; B, monomer B. 


proteins: Ribonuclease A (13.7kDa), Chymotrypsinogen 
A (25.0kDa), Ovalbumin (48.0kDa) and Albumin 
(67.0 kDa). 

Enzymatic Activity Assay—The catalytic activities of 
the wild type and mutated SARS-CoV 3CL?*°s were 
measured by FRET-based assays using a 12-amino 
acid fluorogenic substrate, EDANS-VNSTLQSGLRK 
(Dabcyl)-M, according to our published studies (23, 24). 
During the continuously kinetic assay, the protease (final 
concentration 11M) was pre-incubated for 30 min at 25°C 
with the assay buffer (20mM Tris-HCl pH7.5, 100mM 
NaCl, 5mM DTT and 1mM EDTA), followed by the 
addition of the fluorogenic substrate (final concentration 
10uM). The fluorescence intensity was monitored on a 
GENios microplate reader (TECAN, Mannedorf, 
Switzerland) and the instrument was first set to zero 
with the fluorogenic substrate. Cleavage of the substrate 
as a function of time was measured by the increase in 
emission fluorescence intensity upon continuous monitor- 
ing of reactions in a 96-well black microplate (BMG 
LABTECH, Offenburg, Germany) using wavelengths of 
340nm and 488nm for excitation and emission, respec- 
tively. The incubation of the substrate in the assay buffer 
without the protease was also performed as a control. 
Enzymatic activity was the average of three parallel 
assays and the activity of the wild type SARS-CoV 
3CL°”° was taken as 100%. 


RESULTS 


Preparation of the Seven Mutated Proteases Involved in 
the Dimer Interface—To predict the key factors that 
maintain the stability of the dimer interface, 5-ns CMD 
simulations were firstly conducted on the dimer of SARS- 
CoV 3CL?”*. All interactive residues between monomer A 
and B, as shown in Fig. 1C, were monitored for the time 
occupancy during the whole simulation process. The 
hydrogen bonds formed on the dimer interface were 
calculated by using HPLUS (25). Interestingly, more 
than 10 hydrogen bonding interactions occupy most time 
of simulation (Table 2), suggesting that the residues 
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involved in these interactions might possibly make well 
contributions to keep the dimer conformational stability. 
Guided by this potential information, we selected seven 
residues on the dimer interface for site-directed muta- 
genesis (Fig. 1A). These residues are mainly involved in 
the dimerization of SARS-CoV 3CL?"® through hydrogen 
bonding and hydrophobic interactions with their side- 
chain or main-chain groups (Fig. 2), and single Ala 
substitution might perturb the entirety of the hydrogen 
bonding networks on the dimer interface. 

According to the preparation strategy previously 
reported in our lab (22), we expressed SARS-CoV 
3CL?° as an N-terminal His-tagged protein for purifica- 
tion convenience. Considering the results shown in 
several other publications (10, 15, 26, 27) that 
N-terminal extra amino acids, eg. the purification 
affinity tag, might interfere with dimerization of SARS- 
CoV 3CL?*®, we also constructed the protease into a 
vector without affinity tag and evaluated the dimeriza- 
tion feature of the un-tagged protein. While there are no 
obvious differences observed for the dimer—monomer 
equilibrium in solution between the two purified pro- 
teases (data not shown), thus we performed all subse- 
quent assays with the N-terminal His-tagged 3CL?"°. 

Similar with the wild type protease, all the seven 
single point mutants were also successfully cloned and 
expressed in E£. coli M15 cells. The majority of the 
proteins could be obtained in the soluble fraction of 
the cell lysate. SDS-PAGE analyses indicated that all 
mutated proteases are highly homogeneous in solution. 
Although the corresponding protein bands in SDS-PAGE 
would shift little faster than the molar marker of 
35.0kDa, the recombinant proteins have been clearly 
identified as SARS-CoV 3CL?*° with a molecular mass of 
35.8kDa by mass spectrometric characterization (data 
not shown), in agreement with the values calculated from 
the protein sequences and the published data from our 
laboratory (22). 

Figure 3 shows the Far-UV CD spectra of the wild type 
and seven mutants of SARS-CoV 3CL?"®. The spectra of 
the seven mutated proteases seem to be similar to that of 
the wild type SARS-CoV 8CL?**. All spectra give a 
positive peak at 196nm and dual negative peaks at 209 
and 222nm, typical of a mixture of «-helical and B-sheet 
structures. These results indicated that all seven 
mutated proteases have well-defined secondary struc- 
tures and excluded the possibility of structural misfold- 
ing caused by single residue mutation. However small 
changes of the CD spectra do exist as shown in Fig. 3, 
which might be due to minor structural changes induced 
by Ala mutations. 

The fluorescence emission spectra of the wild type and 
seven mutants of SARS-CoV 3CL?® are also shown in 
Fig. 4. The emission Amax of the wild type SARS-CoV 
3CL?”° is 325nm. Similar to the wild type protease, all 
seven mutated proteins show only minor difference on 
the emission Amax (varying from 324nm to 327nm), 
further demonstrating that replacement of single residue 
on the dimer interface by Ala has not changed the folding 
manner of the protease. 

Chemical Cross-linking Analyses of the Seven Mutated 
Proteases—Similar to 3CL_ proteases of human 
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Fig. 3. CD spectra of the wild type and site-directed 
mutants for SARS-CoV 3CL?*"’. Far-UV CD spectra of the 
wild type and seven mutated SARS-CoV 3CLP’s at 25°C. 
Protein concentrations used in CD experiments were 101M 
and all protein samples were prepared in 20mM sodium 
phosphate pH7.5, 100mM NaCl. The CD spectrum of the wild 
type protease is shown in black and the spectra of the mutated 
proteases are shown in light gray. 
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Fig. 4. Fluorescence emission spectra of the wild type and 
site-directed mutants for SARS-CoV 3CLP*°. Fluorescence 
emission spectra of the wild type and seven mutated proteases 
were recorded at 25°C after excitation at 280nm. The protease 
samples (54M) were prepared in 20mM Tris-HCl pH7.5, 
100mM NaCl. The spectrum of the wild type protease is 
shown in black and those of the mutants are shown in light 
gray. 


coronavirus (HCoV) 229E and transmissible gastroenter- 
itis coronavirus (TGEV) (28), SARS-CoV 3CL?*° can form 
a dimer in the crystal structure and solution (6, 13). The 
dimerization features of SARS-CoV 3CL?*° have been 
successfully characterized by various biochemical and 
biophysical methods (7-9, 12). According to the published 
method (29), we first performed the chemical cross- 
linking analysis of the wild type SARS-CoV 3CL?™. 
When incubated with 0.05% glutaraldehyde, the protease 
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Fig. 5. SDS-PAGE profiles of glutaraldehyde cross-linked 
SARS-CoV 3CL?P"°s. (A) Cross-linking analysis of the wild type 
SARS-CoV 38CLP"°’. (B-H) Cross-linking analyses of Ser1_Ala, 
Phe3_Ala, Arg4_Ala, Ser10_Ala, Glu14_Ala, Ser139_Ala and 
Phel40_Ala mutants, respectively. Lane 1, untreated 3CL?*° 
(5mg/ml); lane 2, molecular weight protein standards; lane 3a, 


at a concentration of 0.2mg/ml displayed a form of 
monomer near 35.0kDa with the other band correspond- 
ing to the dimer (Fig. 5A, lane 6b). With protein 
concentration increasing, both of the dimeric and mono- 
meric forms increased (Fig. 5A, lanes 5b—3b). A similar 
cross-linking pattern of the protease was observed when 
using a higher concentration of glutaraldehyde (0.1%), 
excluding the possibility of obvious artificial cross-linking 
effects (Fig. 5A, lanes 6a—3a). These results indicate that 
the wild type protease exists as a mixture of monomer 
and dimer at varying concentrations, which is consistent 
with the reported studies (7, 8). 

To preliminarily examine the effects of Ala mutations 
of the selected seven residues on dimerization of SARS- 
CoV 3CL?°, the chemical cross-linking analyses of the 
mutated proteases were also performed, respectively. 
Conclusively all seven mutated proteases displayed 
similar cross-linking patterns with the wild type protease 
(Fig. 5B—H). The dimeric form of each mutated protease 
also existed within a wide range of protein concentra- 
tions, suggesting that mutation of single residue on the 
dimer interface could not completely abolish the dimeric 
structure of SARS-CoV 3CL?*° in solution. However 
moderate differences of dimer-monomer equilibria do 
exist among these mutants. For the Serl0_Ala and 
Glu14_Ala mutants (Fig. 5E and F), the amount of the 
dimer was relatively low compared with the wild type 
and other mutated proteases, indicating that the «-helix 
A’ of domain I might be an important part of the dimer 
interface and relatively contribute more to maintain the 
dimer stability of SARS-CoV 3CL?"®. In addition, we 
should note that the possibility of minor artificial cross- 
linking effects might still exist due to the appearance of 
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3CL?*° (5mg/ml) cross-linked by 0.1% glutaraldehyde; lane 3b, 
3CL?*° (5 mg/ml) cross-linked by 0.05% glutaraldehyde; lanes 4a 
and 4b, 3CL*° (1mg/ml), 0.1 and 0.05% glutaraldehyde; lanes 5a 
and 5b, 3CLP*° (0.5 mg/ml), 0.1 and 0.05% glutaraldehyde; lanes 
6a and 6b, 3CLP*° (0.2 mg/ml), 0.1 and 0.05% glutaraldehyde. 


the high-order multimers in SDS-PAGE (Fig. 5). While, 
considering the chemical cross-linking analyses of the 
wild type and mutant proteases were performed under 
exactly same experimental procedures, these analyses 
might still be convincing to preliminarily examine the 
effects of these mutations on dimerization of SARS-CoV 
3CLP°. 

SEC Analyses of the Seven Mutated Proteases—In 
order to more exactly evaluate the perturbation of 
dimer—monomer equilibrium caused by these mutations, 
we performed SEC analyses to further characterize the 
wild type and mutated SARS-CoV 3CL?*°s. We used a 
protein concentration at 5mg/ml for each run, which 
represents the highest concentration used in the cross- 
linking experiments, and the physical states correspond- 
ing to native monomeric and dimeric protease were 
observed. As shown in Fig. 6A, the wild type SARS-CoV 
3CL?P”° elutes in two peaks with the retention volumes at 
44.6 and 62.1ml. The elution profiles of four molecular 
mass marker proteins confirmed that the first peak 
might correspond to the dimer state (71.6kDa) and the 
second peak would represent the monomeric species of 
SARS-CoV 3CLP® (35.8kDa), in well agreement with 
the reported result (8). We also collected the fractions 
representing these two elution peaks and analysed them 
by SDS-PAGE, and the corresponding protein bands 
further indicated that both of these two peaks are SARS- 
CoV 3CL?° (data not shown). The amount of the dimer 
and monomer could be further quantified by the 
integrated area values of these two peaks and the 
dimer/monomer ratio of the wild type protease was 
estimated as 1.02 (Table 3). This observation thus 
indicates that in solution the wild type protease exhibits 
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Fig. 6. Dimer-monomer equilibria of the wild type and 
site-directed mutants for SARS-CoV 3CL?*° analysed by 
SEC. (A) Elution profile of the wild type SARS-CoV 3CL?” at 
neutral pH (7.5) and a concentration of 5mg/ml; (B-(H) Elution 
profiles of Serl_Ala, Phe3_Ala, Arg4 Ala, Ser10_Ala, Glu14_Ala, 
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Ser139_Ala and Phe140_Ala mutants at concentrations of 5 mg/ 
ml, respectively. Elution profiles of four marker proteins are also 
shown in arrow labels. Each protein sample was loaded to a 
HiLoad 16/60 Superdex 75 prep grade column and then eluted at 
a flow rate of 1ml/min with detection of absorbance at 280nm. 
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Table 3. Elution profiles of the wild type and site-directed mutants for SARS-CoV 3CL?*® in solution by SEC analyses. 


Protein Elution peak?™* (ml) 


Elution peak ™°7°™* (ml) 


Dimer (%)* D (%)/M (%) 


Monomer (%)* 


WT SARS-CoV 3CL?"° 44.6 
SARS-CoV 3CL"° Ser'Ala 43.4 
SARS-CoV 3CL?”° Phe?Ala 44.8 
SARS-CoV 3CL?”° Arg*Ala 44.8 
SARS-CoV 3CLP"° Ser?°Ala 45.8 
SARS-CoV 3CL?"° Glu*Ala 44.4 
SARS-CoV 38CLP'° Ser1°°Ala 43.8 
SARS-CoV 3CL?° Phe!*°Ala 44.2 


62.1 50.5 49.5 1.02 
62.8 51.9 48.1 1.08 
63.4 48.2 51.8 0.93 
60.4 31.1 68.9 0.45 
60.2 39.8 60.2 0.66 
62.4 26.5 73.5 0.36 
59.2 44.8 55.2 0.81 
60.6 38.7 61.3 0.63 


“The percentage of dimers (D) and monomers (M) was estimated by deconvolution of the corresponding SEC elution profiles. 


both forms of monomer and dimer and the amount of the 
monomer is almost equal to that of the dimeric form, in 
well agreement with the chemical cross-linking analysis 
and literature report (13). 

For the seven mutated proteases under identical 
conditions, the two elution-peaks representing the 
dimer and monomer states were also monitored, respec- 
tively (Fig. 6B-H). The results demonstrate that dimer- 
ization of SARS-CoV 3CL?” could not be disrupted 
entirely by mutation of single residue on the dimer 
interface, further supporting the chemical cross-linking 
results. Compared with the wild type protease, these 
mutants showed minor drifts on the retention volumes of 
the two elution peaks (Table 3, varying from 43.4ml to 
45.8 ml and from 59.2 ml to 63.4 ml), indicative of possible 
subtle conformational changes of the dimer and monomer 
structures. Furthermore, the dimer/monomer ratios of 
these mutants differentiated significantly from each 
other (Table 3), implying that the contributions of these 
residues to the monomer-—dimer equilibrium of SARS- 
CoV 3CL?*® are quite different. For the Ser1_Ala, 
Phe3_Ala and Ser139_Ala mutants, the ratios between 
the dimers and monomers were 1.08, 0.93 and 0.81, 
respectively, which indicates that these three residues 
could only affect the dimer interface stability to a lesser 
extent. For the other mutants, especially the Arg4 Ala 
and Glul4_Ala mutants, the dimer/monomer ratios 
decreased obviously and were nearly 2 to 3-fold lower 
than that of the wild type protease, suggesting that the 
amount of the dimer has decreased and the monomer is 
the predominant form. Overall, Glu14, Arg4, Phe140 and 
Serl0 (in decreasing order) on the dimer interface are 
the relatively more critical residues for stabilizing the 
dimeric structure of SARS-CoV 3CL?”°. 

Enzymatic Activity Assays of the Seven Mutated 
Proteases—Several published results have proposed that 
only the dimer should be the biological functional form of 
SARS-CoV 3CL?”° and the dissociated monomer might be 
enzymatic inactive (13, 30). Meanwhile, alteration of the 
correct conformation of the dimeric structure could also 
lead to a complete loss of the catalytic activity (8, 14). 
Although Ala replacement of single residue on the dimer 
interface could not completely result in the dimer 
dissociation in solution, the seven residues we selected 
still might affect the catalytic activity of SARS-CoV 
3CL?’° considering their contributions to stabilize the 
monomer—monomer interface. To verify this prediction, 
we determined the enzymatic activities of the wild type 
and seven mutated proteases by a fluorogenic substrate 
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Fig. 7. Fluorescence profiles of hydrolysis of the fluoro- 
genic substrate by the wild type and _ site-directed 
mutants for SARS-CoV 3CL?*°. The fluorogenic substrate at 
a concentration of 10.M was incubated with 14M wild type or 
mutated SARS-CoV 3CL?*° in 20mM Tris-HCl pH7.5, 100mM 
NaCl, 5mM DTT, 1mM EDTA, at 25°C. Increase of emission 
fluorescence intensity at 488nm wavelength was recorded at 
10min intervals, Agx=340nm. The emission spectrum was 
recorded for 90min and the activity of the wild type protease 
was taken as 100%. 


reported previously in our lab (23, 24). The catalytic 
activity of SARS-CoV 3CL?"® and relevant inhibitors 
screening have been characterized extensively by the 
FRET-based assay (26, 31-33). As shown in Fig. 7, the 
fluorescence increase following hydrolysis of the sub- 
strate by the wild type SARS-CoV 3CL?”® is significant 
and time-dependent, implying that the protease could 
hydrolyze the substrate efficiently. As expected, the 
fluorescence profiles of the seven mutants were obviously 
different from that of the wild type protease (Fig. 7), 
which indicates that mutation of these residues could 
inactivate the catalytic activity of SARS-CoV 3CL°° to 
varying extents. In detail, mutation of residues Ser10 
and Phe1l40 almost produced the complete loss of the 
enzymatic activity and the catalytic activities of the 
Phe3_Ala, Glu14_Ala and Arg4 Ala mutants were also 
decreased to only 4-10% of that of the wild type protease 
(Table 4). While the mutants of Serl_Ala and Ser139_Ala 
still possessed 46 and 58% of enzymatic activity, 
respectively. (Table 4). These results further support 
the conclusions derived from the SEC analyses that the 
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Table 4. Enzymatic activities of the wild type and site- 
directed mutants for SARS-CoV 3CL?"°.* 


Protein Proteolytic activity (%) 
WT SARS-CoV 3CLP° 100 
SARS-CoV 3CLF"? SertAla 46 
SARS-CoV 3CL?”? Phe*Ala 5 
SARS-CoV 3CL?"? Arg*Ala 10 
SARS-CoV 3CLP"? Ser?°Ala <1 
SARS-CoV 3CL?° Glu™Ala 4 
SARS-CoV 3CLP"? Ser1?°Ala 58 
SARS-CoV 3CL?° Phe'*°Ala <1 


“Enzymatic activities were averages determined in three parallel 
experiments. The activity of wild type SARS-CoV 3CL?™ was taken 
as 100%. 


extensive monomer—monomer interactions regulated by 
these residues could stabilize the dimeric structure at 
different degrees. However, it is noticeable that the 
influence of these mutations on the catalytic activity is 
more obvious than that on dimerization of SARS-CoV 
38CLP’°, which will be discussed below. 


DISCUSSION 


Proteolytic processing of the non-structural polyproteins 
is a vital step in the replication cycle of coronavirus, and 
such processing is commonly performed by virus-genome 
encoded proteases including 3CL?”® (4, 34-36). Therefore 
3CL?*° has been appreciated as an attractive target in 
discovering anti-coronavirus agents (5). SARS-CoV 
3CLP” shares high homology with the 3CL°*°s of other 
coronaviruses, and the 3D structures of different coro- 
navirus 3CL?”’s are more conserved than their sequences 
(6, 37). SARS-CoV 3CL°° has been extensively charac- 
terized for its structural property and enzymatic activity 
(8, 9, 12-15, 27, 38, 39). The protease can form a 
homodimer in crystal and solution (Fig. 1A and B), and 
the dimeric structure is proposed to be indispensable for 
enzymatic activity. Much progress has been made for 
understanding the correlation between dimerization and 
catalytic activity of SARS-CoV 3CL?”° (7-10, 12, 27, 30). 
Recently a systematic mutagenesis study reported an 
initial attempt to map the dimerization interface on the 
helical domain III of the protease (11). 

In the present study, we focused another seven 
residues on the dimer interface of SARS-CoV 3CL?” for 
single point mutagenesis (Fig. 1A). These selected 
residues are predicted to involve in dimerization mainly 
through hydrogen bonding (Table 2 and Fig. 2). 
Structurally, the seven mutated proteases could be 
divided into three groups. The first group includes 
Serl_Ala, Phe3_Ala and Arg4 Ala mutants regarding 
three residues on the N-terminal finger of SARS-CoV 
3CL?"°. The N-terminal finger of one monomer can form 
intensive interactions with domain II of the other 
monomer (6) (Fig. 2A), e.g. the NH group of Serl in 
monomer B donates hydrogen bonds to the main-chain 
carbonyl of Phe140 in monomer A, as well as the side- 
chain carboxylate of Glul66A; and the side-chain OH 
group of Ser1B forms a hydrogen bond with the main- 
chain NH group of Phe140A. In addition, the NH group 
of Phe3B donates a hydrogen bond to the side-chain OH 


S. Chen et al. 


group of Ser139A. This pair of hydrogen bond might be 
stabilized by hydrophobic interactions between the side- 
chain phenyl of Phe3B and the neighbouring residues, 
e.g. Leu282B and Phe291B. Whereas replacement of 
residue Serl or Phe3 by Ala rendered little influence on 
the dimer—monomer equilibrium of SARS-CoV 3CL?*° 
(Fig. 5B, C and Table 3), indicating that these two 
residues might not play a vital role in dimerization. The 
results are in agreement with a reported study that the 
N-terminal residues1—3 truncated protease still exhibits 
a tendency to form dimer (9). The Serl_Ala mutant 
maintained 46% of enzymatic activity and the activity of 
the Phe3_Ala mutant was nearly 10-fold lower than that 
of the wild type protease (Table 4), implying that in 
addition to dimerization, residues Serl and Phe3 could 
regulate the catalytic activity of the protease by other 
mechanisms. According to the crystal structure (Fig. 2A), 
the monomer—monomer interactions mediated by Serl 
and Phe3 might be helpful to maintain the correct 
catalytic conformation of the S1 subsite, and mutation of 
Serl or Phe’ possibly induces an altered uncompetitive 
conformation of the S1 subsite. Nevertheless, this 
hypothesis should be verified by the crystal structures 
determination of these two mutated proteases. Besides 
Serl and Phe3, another residue Arg4 was also selected 
for mutagenesis study. The side-chain guanidyl of Arg4 
in one monomer forms a salt bridge with the side-chain 
of Glu290 in the other monomer (Fig. 2A), which has 
been reported as one of the major interactions between 
the two monomers (12). Here, Arg4_Ala mutant was 
shown to a tendency to monomer state (Table 3) and a 
very weak enzymatic activity (Table 4), further demon- 
strating the importance of the Arg4-mediated interac- 
tions in the quaternary structure and activity of the 
protease. Although the role of the N-terminal finger has 
been assessed by many investigations (9, 10), our results 
revealed that the residues on the N-terminal finger 
indeed contribute differently to the dimer stability and 
catalytic activity of SARS-CoV 3CL?°. 

The second group of the mutants is about Ser10_Ala 
and Glu14_Ala, which are related to two residues on the 
a-helix A’ of domain I of SARS-CoV 3CL?”. The two 
residues are highly conserved among different corona- 
virus 3CL proteases and also extensively involved in 
monomer—monomer interactions (Fig. 1C). Residue Ser10 
from each monomer can form a pair of hydrogen bond 
between the main-chain NH group and the side-chain 
OH group, and the side-chain carboxylate of Glu14 in one 
monomer donates a hydrogen bond with the main-chain 
NH group of Gly11 in the other monomer (Fig. 2B). In 
the present study, both of the two mutants were shown 
to weak dimerization (Fig. 5E-F and Table 3) and have 
no detectable enzyme activity either (Table 4), indicating 
that the a-helix A’ of domain I might also be a critical 
region for dimerization. Structurally, the a-helix A’ 
(residues Serl0-Gly15) connects to the N-terminal 
finger of SARS-CoV 3CL?*° and might determine the 
correct spatial orientation of the N-terminal finger 
(Fig. 2B). In the dimer structure, the N-terminal finger 
can squeeze into the space between domain III of its 
parent monomer and domain II of the neighbouring 
monomer, which is indispensable for maintaining the 
correct catalytic conformation of the protease (6, 8). 
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Mutation of Ser10 or Glul4 to Ala is possible to partly 
disrupt the structure of the o-helix A’ and produce a mis- 
oriented N-terminal finger, thus making the protease 
completely inactive. However, this conclusion should be 
further confirmed by the crystal structures of the 
corresponding mutated proteases (unpublished data 
from this laboratory). 

In addition, we also performed another group of 
mutants including Ser139_Ala and Phel40_Ala. These 
two residues are located in the oxyanion loop (residues 
138-145) of domain II and involved in the dimer 
interface by interacting with the N-terminus residues of 
the other monomer (Fig. 2A). The oxyanion loop is 
associated with the formation of the S1 subsite in the 
substrate-binding pocket, which determines the absolute 
specificity of SARS-CoV 3CL?”° for Glu in the P1 position 
of the substrate (6, 14). The oxyanion loop is very flexible 
and a rearrangement of its correct conformation could 
induce the collapse of the oxyanion hole (Gly143-Ser144- 
Cys145) in the S1 subsite, therefore, inactivate the 
protease completely (14). The Serl39_Ala mutant 
showed only a minor difference in the monomer—dimer 
equilibrium with the wild type protease (Table 3), 
implying that the contribution of residue Serl39 to 
dimerization is not dominant. While the Ser139_Ala 
mutant preserved only 50% of the wild type activity 
(Table 4), which indicates that mutation of Ser139 might 
directly affect the catalysis, most probably by altering the 
conformation of the oxyanion loop. Although Phel40 
donated hydrogen bonds to Serl through its main-chain 
groups, the Phe140_Ala mutant still had an obvious trend 
to the monomer state (Table 3). It is possible that the 
hydrophobic packing between the side-chain phenyl of 
Phel40 and the residues nearby (e.g. His163, His172) 
would also stabilize the Phel40-Serl interactions 
(Fig. 2A). Meanwhile, mutation of Phel40 completely 
abolished the proteolytic activity of SARS-CoV 3CL?”° 
(Table 4), further suggesting that the interactions 
mediated by Phe140 might also contribute well to maintain 
the conformational stability of the S1 subsite (14). 

In summary, our study characterized the contributions 
of several previously unidentified residues to the dimer 
stability and catalytic activity of SARS-CoV 38CL?”*. Since 
the dimeric structure has been proved to be indispen- 
sable for enzymatic activity of SARS-CoV 3CL?”®, it is 
easy to understand the conclusion of no dimer, no 
activity. In this study, some residues have been revealed 
to be important for both dimerization and _ activity. 
Meanwhile, SARS-CoV 3CL?"® is a very flexible protein 
and the correct conformation state might also be vital 
for the protease to maintain its full activity. Thus, in 
addition to dimer dissociation, an altered conformation of 
the substrate-binding pocket possibly induced by single 
mutation on the dimer interface could also make the 
protease inactive. Our future study should be focused on 
determining the crystal structures of these mutated 
proteases, which will shed more light on understanding 
the dimerization-activity relationship of SARS-CoV 
3CLP”°. 
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